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Let (Ui) be a sequence of i.i.d. random variables uniformly distributed on [0, 1], 
and let Fn be the so-called cumulative empirical function, associated with the 
n first Ui's: 

F„(i) :=n-i#{fZ,<<,ze {!,..., n}}, [0,1]. 

The sequence of processes (F„) converges pointwise a.s. on [0,1] to F defined by 
F{t) = t; this is a consequence of the strong law of large numbers. The Glivenko- 
Cantelli theorem asserts that this a.s. convergence stands also for the uniform 
convergence: a.s. sup^^rQ j^i \Fn{x) — F{x)\ 0. To see this, take ^ xi < ■ ■ ■ < 

Xfc = 1 and check that by monotonicity of F^ and F, sup^gjQ ^ \Fn{x) — F{x)\ < 

maxj max(|F„(a:;j+i) — F{xj)\, \Fn{xj) — F{xj^i)\) ^ maxxj+i — xj, which 

may be chosen as small as wanted. 

In some sense, Donsker's Theorem [1] provides the second term in this con- 
vergence. Consider 

bnit):=V^iFn{t)^F{t)), te[0,l]. (1) 



Theorem 1 (Donsker [l]) The sequence (b„) converges in distribution to the 
Brownian bridge b on D[0, 1] the space of cddldg functions on [0,1] equipped 
with the Skorohod topology. 

Note 1 When the variables Ui 's are not uniform, the study of the empirical 
process reduces to the uniform case thanks to a classical "time change" involving 
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the inverse of the cumulative function of the Ui 's. Some problems of continuity 
arise due to the atoms of the Ui 's but roughly speaking one may say that all the 
difficulties are present in the case of the uniform distribution. 

We recall that the Brownian bridge is the continuous centered Gaussian pro- 
cess such that cov(b(s), b{t)) = s(l — t) when < s < t < 1. It owns the 
following trajectorial representation : 

(b(i))te[o,i] = (Bt-iBi),e[oa], (2) 

where B is the standard Brownian motion. This may immediately be checked 
using that B is a centered Gaussian process such that cov(Bs,Bt) = min(s,t). 

In fact, Donsker proves only in details max bn -^-^ max b justifying the Doob's 

n 

heuristic [3] . One may find in the literature numerous more or less direct proofs 
of Theorem 1. See e.g. Billingsley [2] (and references therein), Kallenberg [4], 
and also more advanced proofs and constructions (and stronger results) as that 
of Komlos, Major and Tusnady ['>]. Some books are devoted to the convergence 
of empirical measures and processes : we send the interested reader to Shorack 
& Wellner [()], van der Vaart & Wellner [7]. As a matter of fact, usual proofs of 
Theorem 1 use often some advanced constructions or are treated in probability 
books when a lot of materials have been introduced, leading to some intricate 
and complex proofs, quite difficult to be taught entirely to beginners. The aim of 
this paper is to present a new proof of Theorem 1 using only "simple" arguments: 
only immediate considerations about the weak convergence in C[0, 1] and D[0, 1], 
and the other very famous Donsker's Theorem which says that a rescaled random 
walk converges to the Brownian motion are used. The Appendix recalls this 
material. 

We begin the proof of Theorem 1 following the steps of Donsker [J]. We 
say that a random vector (Mi)i=i....^„ has the multinomial distribution with 
parameters (A:,pi, . . . ,p„), we write (Afi)i=i^....„ ~ mult(A:,pi, . . . ,p„), when 
F{Mi = mi,i = 1, . . . ,n) = y^"'^' „^ i Yli=i pT' prescribed non negative 

integers mi, . . . , summing to fc, and otherwise. 
The vector (iVj)j=i_..._„ defined by 

Nj ■.= #{^e{l,...,n},U^e[i:j-^)/n,J/n]}, 

has the mult(77, . . . , 1/n) distribution. The empirical process taken at time 
k/n for k G {0, . . . , n} is a simple function of this vector : 

A: 

b„{k/n) = V^iF„{k/n) - F{k/n)) = n-^'^Y.^Nj - 1). (3) 

Let b„ be the process obtained by interpolating b„ between the points {fc/?i, k G 
{0,...,n}}. 
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Let (Pfe) be a sequence of i.d.d. Poisson random variables with parame- 
ter 1. The distribution of {Pk)k=i n under the condition Pk = n (or 

X]fe=i(-Pfe ~ 1) = 0) is also mult(n, 1/n, . . . , 1/n) as can be straightforwardly 
checked. For any fc e {0, . . . , n}, set 

k 

S.=E(^^-1) (4) 

and let S = {Sk)k=Q,...,n be the "centered" Poisson random walk, interpolated 
between integer points. Hence, we have 

(b„(t))tgro.ii = fn"^/^S„i) conditioned by S„ = 0, (5) 

V ^te[o,i] 

and 

sup iMt) - b„(t)| < max Nk- (6) 

tG[0,l] fc=l,...,n 

This is controlled as follow: the iV^'s are Binomial(n, 1/n). By the Markov 
inequality write P(maxfc 

nil + ^Ye-"^ ~ ne'^-'^e-'^ 0, and then 

n ' max Aife > 0. (7) 

/£=l,...,n n 

Hence, by (5), (6) and (7), (see also Lemma 7 in Appendix) Theorem 1 stat- 
ing the convergence of (b„) to b in 13 [0,1] is easily implied by the following 
proposition. 

Proposition 2 The sequence (?T.~^^^S„t)^gjjj conditioned 6?/ S„ = converges 

in distribution to b in C[0, 1] equipped with the topology of the uniform conver- 
gence. 

The proof we propose for this classical proposition is the real novelty of this 
paper. 



The "correction" of a Poisson random walk 

The main line in our approach is the comparison between S and S conditioned 
by S„ = 0. We introduce a correcting process C = (Cfc)fc=o,...,n, such that the 
pair (S, C) have the following feature : 

• S is the centered Poisson random walk (defined in (4)), 

• S -|- C is distributed as S conditioned by S„ = 0. 

Note 2 Transforming a problem involving n random variables into a problem 
involving /„ ~ Poisson(n) random variables is called Poissonization. Taking 
Ui, . . . , Ui^ instead of Ui, . . . ,Un in the construction presented at the beginning 
of the paper amounts to replacing S conditioned &?/ S„ = by the centered 
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Poisson random walk S. This is a Poissonization. The correction of the Poisson 
random walk we propose, which allows to pass from S to S conditioned by Sn = 
is from our point of view different in nature from the usual depoissonization 
techniques. Here, everything relies on an exact combinatoral correction, when 
usually, most rely on the convergence in distribution of {In —n) / \/n, ensuring the 
problem with n variables and with /„ variables being asymptotically equivalent, 
which is not the case here. 

Let us come back to our correction procedure. To fix the details, we will use 
a classical interpretation of Poisson random walk in term of urns/balls. 
Conditionally on S„ = s, the vector (Pi)i=i,...,n has the mult(s+?i, . . . , 1/n) 
law. When m balls labeled 1, . . . , m are sent independently in n urns according 
to the uniform distribution, the vector {N'i)i=i,,,,^n giving the number of balls 
in the urns follows also the mult(m, 1/n, . . . , 1/n) distribution. 

Let us throw Pi balls in urn i where {Pi)i=i....^n are i.i.d. Poisson random 
variables with parameter 1. Then three cases arise: Yll^=i Pi 
or X^Li Pi> (recall that S„ = X^Li Pi ~ 

In the first case S„ = 0, no correction are necessary, then set = for any i. 
The two last cases are treated below. Notice that we focus on the uni-dimensional 
distributions of the process C since this will appear to be sufficient. 

Case S„ < 0. We work conditionally on S„ = s. Since — s balls arc lacking: 
throw —s new balls and denote by the number of new balls fallen in the k 
first urns; for any fc. 



More precisely, (ACA:)fe=i,...,ri ^ mult(— s, 1/n, . . . , 1/n) where ACfc := Cfe — 
Cfe_i is the fcth increment of the correcting process C (with Cq = 0). 

Lemma 3 For any s < and any n> 1, conditionally on — s the process 
S + C is distributed as S conditioned by Sn ~ and ^ Binomial{~s, k/n). 

Proof. If X~mult(n + s,l/n, ...,l/n),y~ (-s, 1/n, . . . , 1/n) and X and F 
are independent then X + Y ^ mult(n, 1/n, ... , 1/n). □ 

Case S„ > 0. We work conditionally on Sn = s. In this case n + s balls 
have been thrown instead of n and then s balls must be taken out. The vector 
{Vk)k=i,....n giving the exceeding number of balls in the different urns (those 
with labels in n + 1, . . . ,n + s) follows the law mult(s, 1/n, • • • , 1/n). Then given 

S„ = s, we search a correcting process (ACk)A;=i,....ri == (— Vfc)fc=i,....„. Of 
course there is a problem to define the correcting process in terms of balls/urns, 
the balls/urns problem living a priori on a larger probability space than the 
(Pi)'s. But this gives us the intuition for a right correcting process: we define 
C conditionally on the P^'s as follows. Let (pi)i=i,....n be non negative integers 
summing to n + s. Set 



P (ACfe = -Cfe, fc e {1, . . . , n}|P, =p,,i€ {1, . . . ,n}) = ^^'-i^-^ (g) 



Cfe ^ Binomial(— s, k/n). 



(8) 



n 
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for any given non negative integers Ci, . . . , c„ summing to s, and otherwise. 

Lemma 4 For any s > and any n> 1, conditionally on — s the process 
S + C is distributed as S conditioned by Sn ~ and ^ —Binomial{s, k/n). 

Proof. We have to check that C + S is distributed as S conditioned by S„ =0: 

V- P(AC, ^ -(k- jO,V^|P, =p„V^) P(P, ^p„Vz) 

P(S„=s) 

(Pi) : }_^Pi=n+s,pi>ji 

fn+s\ e-"n"+° 

= J 1"/ f =P(^,=Ji,Vz|S„ = 0) 
e "■n'yni 

where we have used (9), the fact that n + S„ is Poisson(7i) distributed, and 



E n^ = (i + --- + ir = -^- (10) 

We now show that knowing S„ = s, {—ACk)k=i,...,n ^ muh(s, . . . , 1/n). 
This imphes the second point. Let ci, . . . , c„ be non negative integers summing 
to s. Write P(ACfe = -Cfe,Vfc|S„ = s) 



E 



By (10), S,i + n ~Poisson(7i) and (9), this is easily shown to be equal to 

From now on, consider the process C as being interpolated between integer 
points. 

Lemma 5 For any t E [0, 1], 

n-i/2|c„,+tS„|^^0. (11) 

n 

Proof. We work with C[„tj instead of Cut- For t — 0, (11) holds clearly. Let 
t e (0, 1], and e > be fixed. Write 

P(|Cl„,j + iS„| > eVT^) < V^' + Wi' 

where 

V^' = P(|CL„tj+iS„|>£V^,|S„|eV^[M-\M]), 
W^„^^ = P(|S„|^ V^[Af-\Af]). 
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Let a > be a fixed (small) positive real number. The central limit theorem 
applied to S„ ensures that there exists M such that < a, for n large enough. 
Fix this M. To bound V*^ use F{A\ Ui Bi) = Y,^{A\Bi)f{B,) / P{\jBi) < 
maxiP(A|i?i) for disjoint sets Bi. Hence 

V^' < max P(|CL„tj +tS„| > £07 | |S„| = fc). 

k,\k\e^[M-^,M] 

By Lemmas 3 and 4, P(|CL„.fj +tS„| > e^" I |Sri| = k) = P(|B(fc, \nt\/n)-tk\ > 
£\pn) where [ntj/n) is a binomial random variable with parameters fc and 
\nt\ln. Further, by the Bienaymc-Tchebichev inequality 

max P(|B(fc, \ nt\ /n) - tk\ > ey^) 0. □ 

k,\k\e^[M-^ ,M] 

Proposition 6 (i) The following convergence holds in C([0, 1],M^); 

n 

(ii) The following convergence holds in C[0, 1]: 

n~^^^{Snt + C„t)tgro,ii (Bt - iBi)tgro,i]. 

n 

Proposition 2 is a consequence of (ii) thanks to Lemmas 3 and 4. 
Proof. Assertion (ii) is a consequence of (i). Proof of (i) : the convergence of 
j^-i/2g^ to B in C[0, 1] is given by the other famous Donsker's theorem stating 
the convergence of rescaled random walks to the Brownian motion (see [2] or 
[4]). In particular 

n~i/2S„^Bi. (12) 

The finite dimensional distribution of n~^/^C„. converges to those of the process 
(tBi)(g[o.i]- Indeed by Lemma 5 and (12), 

n""^/^(S„, C„ij, . . . , Cntk) (Bi, — tiBi, . . . , — tfeBi) 

n 

for any < ti < • • • < < 1. Then the family (n^^^^Cn.) is tight since 
it is a sequence of monotone processes whose finite dimensional distribution 
converge to those of the a.s. continuous process (tBi)tg[o,i] (this is Lemma 
8{ii)). Hence the family n~-'^/^(S,i., C„.) is tight. The limit is identified again 
thanks to Lemma 5. □ 

Conclusion 

The idea of this proof appeared after a discussion with Philippe Duchon, where 
he explained me his algorithm to generate uniformly a Bernoulli bridge with 2n 
steps, that is a random walk S = {Sk)k=o,...,2n with increments ±1, conditioned 
by S2n = : build first a simple random walk with 2n steps, choosing i.i.d. 
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increments 4-l,or —1 with probability 1/2. If S2n = then it's done. If not, 
assume that S2n = 2A: > 0. Then pick up at random indices /i, /2, . . . in |1, 2n]. 
If li is the index of a positive increment, change it into a negative one; if it is 
negative then do nothing. Stop when you have changed k increments. By a simple 
symmetry argument the path obtained is uniform in the set of Bernoulli bridges 
of size 2n. I found that this was a nice way to prove that rescaled Bernoulli bridge 
converges to the Brownian bridge; this can be proved using the same argument 
than the ones exposed above: the correction procedure will asymptotically and 
"eventually removes a straight line of the Brownian motion" . Therefore, I tried 
to find other increment distributions for which a similar correction procedure 
would have been possible. It appears to be not so general, or at least, not so 
agreeable. The problem is the following one: in general there does not exist any 
simple correction procedure that conserves at each step of the correction the 
property of the trajectory to have conditionally on its terminal position k, the 
law of a simple random walk conditioned by S„ = fc. 



Appendix 

A simple link between the convergence in C[0, 1] and in D[Q,1\ 

Lemma 7 Let (X„) he a sequence of processes taking their values in £'[0,1]. 
Assume that for any n, Xn = y„ + Z„ where Yn is a continuous process, and 
Zn is a cadlag process. If {Yn) converges in distribution to Y in C[0, 1], and if 

sup \Zn\ -^-^ then {Xn) converges in distribution to Y in D[Q, 1]. 

n 

Proof. As a matter of fact, this statement is easy if one knows: 
(a) {Yn) ^ y in C[0, 1] implies (r„) ^ Y in D[Q, 1], 

n n 

{b) {Zn) cadlag, sup |Z„| ^ implies Zn -^-^ (the null process) on £'[0, 1]. 

n n 

Indeed, knowing this. Lemma 7 is a consequence of the following classical result: 
let An, Bn,Cn bc random variables in a metric space {S,p). If An = Bn + C„, 

An -^-^ A, p{Bn) -^-^ then An + Bn -^-^ A (sec e.g. Billinglsey [_, Theorem 

n n n 

3.1]). 

In order to understand why (a) and (&) hold true, recall that the Skhorohod 
topology on D[0, 1] is defined by a metric: let A be the set of strictly increasing 
continuous functions A, satisfying A(0) = 0, A(l) = 1. The metric is 

4(/,.g) = inf I sup{|A(i) -t\,te[0, 1]} V sup{|/(A(0) - git)lt e [0, 1]}}. 



Hence ds{f,g) < sup{|/(i) — g{t)\,t e [0,1]}; this yields immediately to (a) 
and (6). □ 
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Convergence in C[0, 1] and in C[0, 1]^ 

We recall some classical facts concerning the weak convergence in C[0, 1] and 
(C[0, 1])^. First tightness and relative compactness are equivalent in these sets 
by Prohorov's theorem, since they are both Polish spaces. 

Lemma 8 {i)Let (X„,y„) be a sequence of pairs of processes in (C[0, 1])^. The 
tightnesses of both families (Xn) and (1^) imply that o/(X„,y„). 
(ii) Let (Xn) be a sequence of monotone processes in C[0, 1]. // the finite di- 
mensional distributions of (Xn) converge to those of an a.s. continuous process 

X, then (Xn) is tight and then Xn -^-^ X in C[0, 1]. 

n 

Proof, (i) Take two compacts Ki and K2 of C[0, 1] such that P(X„ & Ki) > 
1-e and P(r„ € X2) > 1 - then P((X„, y„) e K1XK2) >l-2e and Ki x K2 
is compact in (C[0, 1])^. 

{ii) Only the tightness of (X„) in C[0, 1] has to be checked. For any function 
/ : [0, 1] M, and S > 0, the global modulus of continuity of / is 

Losif) = sM\f{x) - fiy)\,x,y e [0,l],\x - y\ < S}. 

Since X„ is increasing, for any positive integer to. 



i^i/m{Xn) < Ajn,n := 2 max | 



x„(-)-x„(^) 

TO TO 



1, 



Since the finite dimensional distributions of {X„) converge to those of X , 
(d) 



Am..n. ^ Am. '• — 2 maX 



TO 771 



and by the uniform continuity of X, A„i ^l^^' Q. Hence lim lim sup V{uii /,„ {Xn) > 

m 7n n 

e) = for any e > 0. □ 
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